Unitary gauge considered harmful 
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Tommy Anderbergj 
(Dated: January 2, 2009) 

I informally review the Higgs mechanism, focusing on fundamental aspects which should be com- 
mon knowledge but apparently are not, explain why your understanding of gauge symmetry breaking 
is a decreasing function of your reliance on the unitary gauge, and discuss some implications for 
cosmology (domains, dark energy, CMB cold spot). 

PACS numbers: 11.15.Ex,11.27.+d,12.10.-g,12.15.Ji,95.36.+x,98.80.-k,98.80.Cq 



I. INTRODUCTION 

Long ago, Dijkstra observed that "the quality of pro- 
grammers is a decreasing function of the density of go to 
statements in the programs they produce" and famously 
argued that gotos should be abolished from high level 
programming languages [l|. If you are old (or geeky) 
enough to have ever used gotos, you know why: while 
they can simplify the task at hand and even make pro- 
grams run faster, their repeated use quickly leads to 
"spaghetti code". That's a technical term for "incom- 
prehensible mess" . The short term efficiency gain is paid 
for with a loss of understanding, which inevitably ends 
up causing errors. 

In recent years, I have observed an analogous phe- 
nomenon in physics. Initially, I thought that it only 
occurred in isolated and particularly unfortunate cases 
(after all, finding oneself talking to me could be consid- 
ered pretty unfortunate). As my sample has grown to in- 
clude distinguished practitioners (or so I'm told), I have 
reluctantly been forced to accept that it is not so. The 
problem is ubiquitous. 

With only months remaining before $10 billion worth 
of LHC start looking for the Higgs (or whatever actually 
breaks electroweak symmetry) it is my regrettable duty 
to inform you that many, probably even most of you do 
not understand gauge symmetry breaking. The culprit is 
none other than the "goto" of particle physics: the uni- 
tary gauge. Like the goto statement, the unitary gauge 
does simplify some tasks (proving unitarity, estimating 
mass spectra, introducing students to gauge symmetry 
breaking), but only at the cost of understanding. This 
loss of understanding is causing errors, some of them po- 
tentially spectacular. 



II. CLASSICAL THEORY 
A. Goldstone model 

The standard textbook example of spontaneous sym- 
metry breaking (SSB) is the Goldstone model Q, defined 



by the Lagrangian 

with "Mexican hat" potential 



y(0) = A 
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where (/) = (j){x) is a complex Lorentz scalar field and A, 
v are real, positive parameters. V{(f>) has a degenerate 
"valley" of connected minima at (?!)t(?!) = i^^/2, i.e. a circle 
in the complex plane, and a local maximum at 0^0 = 0. 

Solving the full equations of motion may be too hard 
for us, but we can at least obtain approximate solutions 
using perturbation theory. To do so, we must pick a sta- 
ble point (0) about which small oscillations in will re- 
main small. That means a point in the valley. Since gra- 
dient energy is minimized when (j){x) has the same value 
for all spacetime coordinates x, any constant </> in the 
valley minimizes total energy and therefore qualifies as a 
ground state ("vacuum") of £(0). The circle 4>^4> — v"^ /2 
is therefore known as the vacuum manifold of £(</>) . It has 
a global U(l) ~ 0(2) symmetry, i.e. it's invariant under 
rotations of in the complex plane. Picking a partic- 
ular vacuum breaks this symmetry. This is the essence 
of SSB: the theory (meaning the Lagrangian), has more 
symmetry than any individual ground state. 

Let's arbitrarily choose the point 



<«^7! 



(3) 



on the real axis as the starting point for our perturba- 
tive expansion. To parameterize perturbations about it, 
introduce orthogonal field coordinates 4>i (along the real 
axis) and (j)2 (along the imaginary axis): 



</> = (0) + 



01 + l(P2 

V2 

1 +i'f>2 



V2 
In terms of (pi and (j)2 , the Lagrangian is 



(4) 
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C{^uh) = 2(^M'^i)' + ^id^c^^f - 1/^(01, 02) (5) 



with potential 

V(*l,*2) 
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Note the first term: it gives (j)i a mass squared 2\v^ . 
There is no term quadratic in 02, so (j)2 is massless. 

The reason for this difference is that (/)i describes dis- 
placements against the restoring force of the potential, 
while (/>2 describes displacements along the flat potential 
valley. The generalization of this observation is Gold- 
stone's theorem: spontaneous breaking of a continuous 
symmetry implies the existence of a massless field (a 
Goldstone boson; in supersymmetric theories, there are 
also Goldstone fermions - goldstinos - to match) for each 
flat direction. In mathematical terms, the number of 
such Goldstone bosons is the dimension of the coset space 
G/H, where G is the full symmetry group of the vacuum 
manifold and H is the subgroup of G under which the 
vacuum remains invariant (if any) . 

The parametrization of Eq. ^ using Cartesian field 
coordinates 4>i and (j)^ is the natural choice for a pertur- 
bative expansion, but ill suited to exploring £ beyond 
the infinitesimal neighborhood of a particular vacuum, 
i.e. to finding non-perturbative solutions. For that, it's 
more convenient to use curvilinear coordinates which fol- 
low the geometry of the vacuum manifold. In the simple 
case of the Goldstone model, this means standard polar 
coordinates: 



(0) 



V2 



V + P^^e 



V2 
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The leading terms of a Taylor expansion of 4> in the radial 
and angular displacements p and 6 are 



v + p + iv9 



(8) 



By comparison with Eq. ([4]) , for small perturbations 

d^i ^ P (9) 

02 ^ vO (10) 

(02 parameterizes the tangent of the vacuum manifold at 
= (0)). In terms of p and 0, the Lagrangian is 



-v{p) 

with ^-independent potential 

V{p) = \[p{v + p/2)] 

^4 



{p + vf{dMd'e) 
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(11) 
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from which we again read off a mass squared 2\v^ for 
the radial excitation (p), while 9 remains massless. 



B. Higgs mechanism 

The Goldstone model is turned into the simplest exam- 
ple of the Higgs mechanism Q [3] by upgrading its global 
U(l) symmetry to a local U(l) symmetry. The Gold- 
stone Lagrangian is invariant under an identical rotation 
of 0(a;) at every spacetime point x; the Higgs Lagrangian 
is invariant under an independent rotation at each space- 
time point. 

To construct it, introduce an Abelian gauge field Af^{x) 
with field strength tensor 



i* III' — OnAiy — U^Aij 



and Lagrangian 



Ca^--F^uF>'-' 



(13) 



(14) 



and substitute ordinary derivatives with covariant deriva- 
tives 



9. 



Df, = d^ + igA^ 



(15) 



{g is the gauge couphng constant) in £(0). The result- 
ing total Lagrangian is invariant under the simultaneous 
local transformations 



0(a;) -> 0'(a.) 



-iuj{x) 



0(x) 



Af,{x) -^ A'^{x) = Af,{x) + -d^Lu{x) 



(16) 
(17) 



The variation in the gauge field Afj_{x) exactly compen- 
sates the variation in 0(x), i.e. if we apply the transfor- 
mations of Eqs. (|16[) and P7|) to the total Lagrangian 
and work through the algebra, we end up with exactly 
the same expression, apart from the trivial substitutions 



0(x) 
A,.{x) 



0'(x) 
A',{x) 



(18) 
(19) 



This freedom to apply a local transformation implies that 
there are fewer independent equations of motion than 
degrees of freedom. The problem is of course the gauge 
field. Its Lagrangian is identical to that of a photon, with 
in the role of a charged scalar, so what we have here 
is just scalar electrodynamics with an unusual T^(0). We 
know that photons have only two independent degrees 
of freedom, the transverse polarization states, but A^(x) 
has four. As in electrodynamics, we must therefore sup- 
plement the equations of motion with a gauge fixing con- 
dition before we can actually compute anything. 

Following the standard textbook approach, we post- 
pone that decision and choose a vacuum first. Choos- 
ing again (0) = i^/V2 and expanding {d ^(j))'^ {d^ (j)) -^ 



{D^(b)HD'^(b) yields 

+igA^{{^^^q^)^-cj)^{^^'cj))) 
In Cartesian coordinates, the new terms are 



(20) 
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In polar coordinates, they are 



{giyfA^A^ 

+gA^p{d^e){2:y + p) 

^g^A^A''p{2iy + p)/2 

+gv^Af,dfe 



(22) 



The first term tells us that the gauge field A^ has picked 
up a mass squared 2{gv)'^. The remaining terms, except 
for the last one, describe interactions involving yl^, (pi ~ 
p (massive as in the Goldstone case) and 02 ^ vO (still 
massless). 

The last term is problematic: it's quadratic in the 
fields, like a mass term, but it mixes ^^ and (/)2 ~ vO, sug- 
gesting that they are not independent. Counting degrees 
of freedom confirms this. We started out with a com- 
plex scalar (two real components) and a massless vector 
field (two more) for a total of four degrees of freedom. 
We ended up with two scalars and a massive vector field, 
which also has a longitudinal polarization state, for a 
total of 2 + 3 = 5 degrees of freedom. Since a simple 
change of coordinates can not affect the number of de- 
grees of freedom, one of them is redundant. 

This is where your favorite introductory field theory 
textbook points out that (/)2 ~ v9 can be made to van- 
ish at every spacetime point x using the invariance of 
the total Lagrangian under the simultaneous local trans- 
formations of Eqs. (fTB)) and P7|) . In polar coordinates, 
this is trivially easy to see: simply set the transforma- 
tion parameter u!{x) equal to the angular displacement 
0{x). The transformed field (t)'{x) will then be real, i.e. 
6' ~ vB' = 0: 



cj,'{x) 



-ie(x] 



<j>ix) 



V2 



_ u + p{x) 

Provided that we also perform the transformation 

A^{x) ^ A!Ax) = A^(x) + -d^d(x) 



(23) 



(24) 



the only effect on the Lagrangian, expressed in terms of 
(/) and A^^ , is to adorn the fields with primes according to 
Eqs. (fT8l)-(fT9|. 

We conclude that gauge invariance allows us to impose 
the condition 



Q{x) = ^ 02(a;) = 



(25) 



so as to remove the extra degree of freedom (known as 
a "would-be" Goldstone boson) and leave us with a La- 
grangian written exclusively in terms of a massive A^ (x) 
(colloquially said to have "eaten" the would-be Goldstone 
boson in order to acquire a longitudinal component) and 
a massive <i>\{x) ~ p(a;). The latter is the simplest ex- 
ample of a Higgs boson. The condition of Eq. (P5|) (and 
its equivalents for larger symmetry groups) is the unitary 
gauge. 



C. Can (maybe), not must! 

The fact that we can impose the unitary gauge does 
not imply that we must do so, of course. There is a liter- 
ally infinite number of conditions which may legitimately 
be used to remove the redundant degree of freedom, and 
you are free to use whichever is most convenient. Physical 
observables like energy density are by definition indepen- 
dent of this choice, but if you wish to directly compare 
solutions obtained in different gauges, all you have to do 
is transform them to a common gauge. 

The main requirement on a gauge condition is that 
it be reversible: given an arbitrary field configuration 
(A^(x), (/)(a;)), a transformation must exist such that 
the transformed fields satisfy the condition and can be 
uniquely transformed back to the original configuration 
(see e.g. p. 7 in [5|). This ensures that no information is 
lost by applying the condition. If this requirement is not 
satisfied, strictly speaking the condition is not a gauge 
condition, but an unphysical constraint. It may still be 
useful in special situations, if it only cuts out a part of 
configuration space disjoint from that of the configura- 
tions under consideration, but it can not be used in full 
generality. 

It is not hard to see that the unitary "gauge" is in fact 
such an unphysical constraint: it fails to be reversible at 
(\){x) = 0. 

A famous example of the solutions living in the null 
space of the unitary gauge (its "blind spot" ) is provided 
by the topologically stable vortices first described by 
Nielsen and Olesen [6]. Such a vortex is characterized 
by a non-trivial map from polar field coordinates (p, ff) 
to polar spatial coordinates (r, kp) 



imp 



and satisfies 



0(r,(p) = p{r)e 



lim drp{r) ~ 

r — ^oo 

lim p(r) — 

r— >0 



(26) 

(27) 
(28) 



Extend it to a cylinder and you get a flux tube (or 
"string", not to be confused with fundamental ones) 
known to cosmologists as a cosmic string. The "winding 
number" n is an integer (the Pontryagin index) specifying 
the number of times (j) goes around the potential valley 
as you walk along a single loop about spatial origin. The 
stability argument is simple: changing n requires the field 
to be lifted out of the potential valley and slid over the 
top of the Mexican hat, at an energy density cost oc Xv'^. 
If you try to transform a Nielsen-Olesen vortex to the 
unitary gauge throughout all space, you will inevitably 
run into trouble with Eq. ([M)) at r = 0: 6 \s, undefined 
there, so A' will be undefined too. The singularity at 
(t){x) — blinds the unitary gauge to this kind of so- 
lution. Nielsen and Olesen instead used the time-axial 
gauge Aq = 0, and in so doing established the gauge of 
choice for non-perturbative work iri gauge field theories 

with ssB HiililPEipElIil 1. 

The lesson here is that while SSB is an intrinsically 
non-perturbative phenomenon, the unitary gauge is all 
about perturbation theory. There is nothing wrong with 
the Higgs Lagrangian expressed in terms of the original 
field, i.e. before shifting field coordinates to a particular 
vacuum. The shift's only purpose is to allow the use of 
perturbation theory. If you are not after a perturbative 
expansion, skip the shift and use a standard gauge con- 
dition for the gauge field(s). If losing the convenience of 
manifest Lorentz covariance is not a concern, a simple 
condition like the radiation gauge V • A = or an axial 
gauge is all you need. 



D. Non-Abelian Higgs mechanism 

There are two ways to summarize the construction of 
the Abelian Higgs model. You could say that we take 
the Goldstone model and gauge its U(l) symmetry (as 
we did above) or you could say that we take an Abelian 
gauge field and "Higgs" it. Either way, the procedure is 
readily generalized to the non-Abelian case. 

Consider a generic Yang-Mills (i.e. non-Abelian) gauge 
field theory [ig| (henceforth simply YM), defined by the 
Lagrangian 



l-W — .^afii^^a 



(29) 



where the field strength tensor Faf^, and gauge field Wafj, 
now carry the group index a 



Fn 



d^^Wau - d.Wa^, +gCabcWb^Wc, 



(30) 



and C'abc are the totally antisymmetric structure con- 
stants of some simple Lie group, such that representation 



^ It would be interesting to compare notes with these authors. 
Were they too, at some point, summarily dismissed with claims 
that "the true dynamics is given by the unitary gauge"? 



matrices Ta satisfy 

[Ta , Tb] — i CabcTc 



(31) 



To preserve gauge invariance, any matter field ^(a;) 
added to the Lagrangian (fermion or scalar, at this point 
we don't care) and coupling to Wafi must transform ac- 
cording to 



^^ ^' = C/* 



with 



Uix) 



-iuJa{x)Ta 



(32) 



(33) 



and transformation parameters uJa{x). The covariant 
derivative 



D^=df,+ igWa^Ta = df, + igW^ 



(34) 



compensates the variation in terms containing Z)^^ (by 
making them transform like \1/, so that globally invariant 
combinations like '^^df^^ become locally invariant after 
the substitution d^ -^ D^) provided that the gauge fields 
undergo the simultaneous transformation 

= UWa^TaU^ + -{d^U)U^ (35) 

Note that we recover the Abelian case when T = 1. 

The smallest non-Abelian Lie group is SU(2), with 
structure constants 



L^ai 



be 



^abc 



(36) 



(the totally antisymmetric Levi-Civita symbol, with 
£i23 = !)■ lu the fundamental (spinorial) representation. 
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where Ti , T2 , T3 are the Pauli matrices 



Tl 



rn 1 1 




[0 


—i 




r 1 


n1 


1 


T2 = 


i 





T3 = 





-1 



(37) 



(38) 



A scalar field in this representation must transform as an 
SU(2) doublet, i.e. 



$ 



$' 



e 2" 



(39) 



The superscripts "-I-" and "0" are just labels (for now). 
To induce SSB, we simply substitute this $ doublet 
into the Goldstone Lagrangian, Eq. ([1]) , let d^ — > D^ and 
add the result to Cw ■ In terms of the real $ components 

</>!, </'2, (j)^ and 4>i 



^0 



63 + 104 



(40) 
(41) 



the (Higgs) vacuum manifold is now a 3-sphere defined 
by 

$t$ = (0i)2 + (02)2 + (03)2 ^ (0^)2 ^ ^2/2 (42) 

There is actually more than we bargained for here: the 
symmetry of a 3-sphere is 0(4) ^ SU(2)xSU(2), twice 
the size of the SU(2) which we wish to break. This choice 
is made in anticipation of quantization. Classically, any 
symmetric potential will do, but if we want the quantized 
model to be renormalizable, the potential can be at most 
quartic in the fields (see [ITI] for a systematic survey of 
SU(N) and 0(N) symmetry breaking patterns). 

To estimate the perturbative mass spectrum, repeat 
the steps followed in the Abelian case. Shift the origin of 
our $ coordinates to 



($) 



1 

V2 



(43) 



(i.e. let 00 -* iy/V2 + (j)o), expand the (£)^$)t(DA'$) 
term in the Higgs Lagrangian and read off the quadratic 
terms: 



5(?)^--"'" 






(44) 



All three gauge bosons have become massive, along with 
the radial $ component in the chosen vacuum; but as 
expected, there are also terms mixing gauge bosons and 
derivatives of the angular $ components. They can again 
be removed by transforming to the unitary gauge: pass 
to "polar" field coordinates 



$(x) 



1 



V2 



e3»»(^)^» 





v + p{x) 



(45) 



and set the transformation parameters uJa{x) of Eq. (|39p 
equal to the angular displacement 9{x). Just as in the 
Abelian case, the exponentials cancel, and we are left 
with the real, radial component 



*'<^' ^ 7i 





f + p{x) 



(46) 



The would-be Goldstone bosons are gone, "eaten" by the 
gauge bosons, which have in turn been transformed ac- 
cording to Eq. (P5|) . 

It should now be evident by inspection of Eq. ((33|) that 
the analogous procedure will work for any semisimple Lie 
group, i.e. for any product of simple Lie groups. His- 
torically, the electroweak sector of the Standard Model 
[l^[i^ was constructed this way, by Higgsing an exist- 
ing gauge invariant Lagrangian, Glashow's U(l)xSU(2) 
model [2011. 



E. Instantons and sphalerons 

As we have already seen, the unitary gauge does not 
tell the whole story about the Abelian Higgs model. In 
the non- Abelian case, it reveals even less. 



Consider again the Nielsen-Olesen vortex of Eq. (|26p . 
In topological terms, it is stable because a non-trivial 
map from polar spatial coordinates to polar field coor- 
dinates, i.e. from a circle (spatial infinity) to another 
circle (the vacuum manifold), can not be continuously 
deformed to the map from a circle to a single point; the 
maps belong to different homotopy classes. In the non- 
Abelian case, this stability argument need no longer hold. 
With more directions available in field space, there are 
more ways to deform a vortex without leaving the vac- 
uum manifold: if the latter has the topology of a sphere 
rather than of a circle (i.e. if it is simply connected), 
there is nothing preventing a loop on it to be contracted 
to a single point, so classical stability of vortex solutions 
is no longer guaranteed. Such solutions can still occur as 
so-called embedded defects (solutions of a model based 
on a subgroup of the symmetry group under consider- 
ation) [2ll][22| but to find out whether they are stable, 
you must study their dynamics in detail. Since <& going 
to zero at some point is a common property of defects, 
whether embedded or not, the unitary gauge is oblivious 
to their existence. 

The problem is compounded by the non-trivial vac- 
uum structure of YM theory, even without Higgs. Con- 
sider the gauge transformation of Eq. (|35p applied to the 
trivial vacuum solution Wa^ = 0. Since a gauge transfor- 
mation can not affect observable quantities (e.g. energy 
density) the resulting "pure gauge" configuration must be 
a vacuum solution too. But with a non-Abelian symme- 
try group at our disposal, we can clearly use Eq. ((33)) to 
create non-trivial maps between spacetime and internal 
(i.e. group) space: the smallest non-Abelian Lie group, 
SU(2), has three generators, so we have always at least 
one generator for each spatial dimension. For instance, 
with transformation parameters 



UJa{x) 



\/x^ 



(47) 



(where n is an arbitrary number and n is some inte- 
ger) Eq. (j33p maps each point U in group space to n 
points at spatial infinity (x^ — > 00). As with the Abelian 
vortices, two maps with different winding number n can 
not be continuously deformed into each other, i.e. they 
belong to different homotopy classes (unlike maps dif- 
fering merely by k). Substituting them into Eq. (j35p 
therefore yields topologically distinct vacua, aptly called 
n-vacua, separated by energy barriers in configuration 
space. Since n can be any integer, the vacuum of pure 
YM theory is infinitely degenerate. 

In the absence of other fields, transitions between n- 
vacua are classically forbidden. Upon quantization, tun- 
neling transitions (instantons) become possible in prin- 
ciple, but remain far below observable rates in practice 
dBl [111 [2^ Q [23] . When you introduce $ however, non- 
contractible loops [2a| and spheres [2§| appear in con- 
figuration space, implying the existence of saddle point 
(i.e. unstable) static solutions perched on top of the bar- 
riers, along the minimum-energy paths between neigh- 



boring n-vacua. Such solutions are known as sphalerons 
(Greek for "ready to fall"). Although they were first dis- 
covered in the SU(2) 0] and U(l)xSU(2) Higgs mod- 
els 8] [30] 31] (where Nambu's embedded vortices 1211 
were also retroactively recognized as sphalerons [32| 13311 ) . 
they have since turned up elsewhere too, from QCD [Sj] 
to Einstein- Yang-Mills theory (YM coupled to gravity) 
[35J [lOj , showing that the primary role of $ in enabling 
their emergence is to provide a finite energy scale for the 
barriers. In the standard electroweak model, this scale is 
^lOTeV. 

The discovery of sphalerons posed a serious challenge 
to cosmology. Transitions between electroweak n-vacua 
violate conservation of baryon number (B) , so in thermal 
equilibrium they erase any net baryon number introduced 
as an initial condition or created at higher energy scales 
(because the Boltzmann distribution gives equal weight 
to particles of equal mass, hence to baryons and anti- 
baryons). This is known as the baryon washout problem. 
In the hot early universe, sphaleron transitions kept a B- 
violating channel open all the way down to T ~ 100 GeV, 
so baryogenesis can not have happened much above the 
electroweak scale. Fortunately, under non-equilibrium 
conditions sphalerons may also provide an efficient mech- 
anism for the production of a net B [30| [33 [3^| . 

It should come as no great surprise that all these solu- 
tions feature points where <I> = 0, so they all live in the 
null space of the unitary gauge. The problem has been 
known for a long time: gauge groups are compact, but 
the unitary gauge insists on topological triviality ^] . A 
"unitary gauge-like" description of all physical degrees of 
freedom is therefore necessarily singular [40| [4l| . 



and write the transition amplitude between the vacuum 
in the infinitely far past and the vacuum in the infinitely 
far future as the path integral 

W[J] = (0, +oo|0, -oo) ^M j V^ S^+J"^) (49) 



where A/" is a normalization constant. The expectation 
value of $ can then be obtained by varying W[J] in J: 



6W[J] 
5J{x) 



= i(0, -|-cx)|$(a;)|0, -oo)j 



(50) 



In practice it's more convenient to work with the func- 
tional Z[J] (the generator of connected Green functions) 
satisfying 



W[J] = e*^["'l 



(51) 



The closest relative to a classical field is the semiclassical 
(or "mean") field 

^s.{.) = '-M - <"'+-''^(f)'"'— )^ (52) 



5J{x) 



(0, -Hoo|0, -oo) 



In the absence of sources, it reduces to the vacuum ex- 
pectation value (VEV) of $(a;); the condition for SSB in 
the quantum theory is therefore 



*sc(a;)j=o 7^ 



(53) 



If the functional dependence of (^sc{x) on J{x) is invert- 
ible, we can eliminate the explicit source dependence us- 
ing the functional Legendre transform 



III. QUANTUM THEORY 

Brief mention of instantons and baryon number non- 
conservation aside, everything we have done so far is 
strictly classical field theory, i.e. our fields have been 
ordinary functions. But as Feynman would say, "I'm not 
happy with all the analyses that go with just classical 
theory, because nature isn't classical, dammit" [42|. To 
make contact with the real world, we must quantize. In 
quantum field theory (QFT) this means turning the fields 
and their conjugate momenta into operators and impos- 
ing canonical commutation relations between them. Ob- 
servables are then obtained as expectation values. 



A. Effective action 

Following Feynman, we introduce the action functional 
for a set of fields $(a;) coupled to sources J{x) (all indices 
suppressed; each field has its own source) 



T[^sc] = Z[J] - (J$, 



»)^j 



{£ + J$) = / d-^x [C{x) + J{x)^{x)] 



(48) 



by which 



J{x) 



SJ<i>sc{x) 



(54) 



(55) 



Without interactions, this expression reduces to the clas- 
sical equations of motion for $. With interactions, there 
are corrections ex h due to quantum fluctuations about 
the classical trajectory. 

r["I>sc] is known as the effective action. It can be writ- 
ten as the multilocal expansion 



oo 



■ ^sc{xi)...<i>sc{Xn)r^"\xi, ...,Xr,) (56) 

where T^"''{xi, ...,Xn) are the proper vertices (one- 
particle irreducible Green functions) of the theory. Tay- 
lor expansion of ^scixk) for fc > 1 about xi yields the 
quasilocal form 



T[^sc] = (-F($,e)a^$.c5^$«e - V^($.c) + ...) (57) 



where the functions F{^sc) and V{^sc) summarize our 
ignorance and "..." stands for higher order derivative 
terms. V{^sc) is the effective potential If we require 
the vacuum to be translation invariant, the condition for 
SSB can be written 



W($.c) 



5$. 



(58) 



iS>^^=u^O 



i.e. the effective potential must have an asymmetric min- 
imum, just like its classical counterpart. 

Note that none of this involves perturbation theory. 
The shift of Eq. ([3]) is sometimes motivated with an anal- 
ogy to elementary quantum mechanics, by anticipating 
the quantization of small oscillations about the bottom of 
the potential and their identification with particles, but 
as you can see, nothing in the formalism requires it. You 
are free to quantize the original, unshifted Lagrangian, 
and if you want to study non-perturbative phenoma, like 
SSB, that is what you should do. The vacuum manifold is 
then counterintuitively characterized by a non-vanishing 
occupation number, i.e. the vacua are not empty, but 
rather filled with the condensate of Eq. ([55)1 . 



B. Gauge fields 

When gauge fields are included, we are again con- 
fronted with the redundacy which they bring about. The 
path integral in Eq. (|49)) sums over all configurations, 
including those related by gauge transformations, and 
so becomes even more divergent than usual. The solu- 
tion is to restrict the functional measure to only one rep- 
resentative from each gauge-equivalent class (Faddeev- 
Popov ansatz [43|). In practice, a factor det(M/)(5(/(x)) 
is introduced (x now denotes both gauge and, option- 
ally, matter fields). The function /(x) goes to only 
when the gauge condition is satisfied; Mf = Sf/Su; is the 
functional Jacobi matrix of /(x) which encodes its re- 
sponse to infinitesimal gauge transformations^. The re- 
quirement that the gauge condition be reversible^ trans- 
lates to det(M/) ^ 0. 

When Mf depends on the fields, it's convenient to 
write 



det Mf 



,Tr{lnMf) 



(59) 



^ See |44ll and Section 9.1 in |147| for an introduction to the geo- 
metric interpretation of the Faddeev-Popov ansatz in terms of a 
hyperplane which intersects each gauge orbit once. 

^ The Coulomb gauge famously fails to satisfy this requirement at 
the non-perturbative level in the non-Abelian case |45|| . and no 
gauge satisfies it when spacetime is a four-s phe re, i.e. the hyper- 
sphere of five-dimensional Euclidean space |4al . While this may 
seem like a mathematical curiosity, it's worth keeping in mind 
that which spacetime you're working in can affect the validity of 
a gauge condition. 



and treat the exponential as additional terms in the ac- 
tion. This is done by introducing fictitious "ghost" fields. 
The full £ going into Eq. (|49p then consists of the orig- 
inal Lagrangian, the ghost Lagrangian and S{f{x)) in 
exponential form. 

A simpler alternative is to use a gauge condition with 
field-independent Mf, so that det(Af/) can be trivially 
factored out of the path integral. When the gauge condi- 
tion is sufficiently simple for the Lagrangian to be writ- 
ten directly in terms of the independent variables, the 
5{f{x)) factor can be explicitly enforced and so does not 
appear as a separate gauge fixing term. A gauge with 
these properties is known as a physical gauge, since it 
only uses physical degrees of freedom. 

The axial gauges Wq = and W^ = are physical 
gauges^ (see [47| for a review). The naively defined uni- 
tary gauge is occasionally claimed to be one too, but since 
it produces gauge bosons with mass terms on the same 
form as those of the classical Maxwcll-Proca Lagrangian 
for massive vector fields, it succumbs to the same prob- 
lem upon quantization: a badly divergent perturbative 
expansion which can not be renormalized and so is use- 
less for (most) practical calculations. From a modern 
perspective, it should come as no surprise that arbitrarily 
cutting out a part of configuration space spoils renormal- 
izability (especially since the path integral runs over all 
field configurations, whether classical solutions or not), 
but the original work was done in the older canonical for- 
malism. Historically I48| , this is why the Weinberg-Salam 
model of electroweak interactions was not taken seriously 
until 't Hooft's proof of renormalizability in (initially a 
special case of) the R^ gauges [4g| [S^ . 

The R^ gauges provide the only way to make sense of 
the unitary gauge at the quantum level that I know of. 
For the Abelian Higgs model, the gauge condition is 



d^^A^ 



igv4>2 = 



(60) 



When it's imposed, the Goldstone boson acquires a mass 
squared C(5^)^- The ^ parameter can take any value from 
(Landau gauge, classically equivalent to the Lorenz 
gauge, featuring massless Goldstone bosons) through 1 
(Feynman gauge, with Goldstone and gauge boson of 
equal mass) to the formal limit ^ ^r oo. The gauge 
boson propagator picks up an extra pole at the same 
mass as the Goldstone boson, so propagating Goldstone 
bosons damp out within a distance ^ l/{gi/) [5l| (es- 
sentially the same cancellation is seen in the axial gauge 
[53|). This gives the figure of speech that gauge bosons 
"eat" Goldstone bosons a much more direct interpreta- 
tion than in the classical theory: they really do! The 



^ The time-axial gauge Wq = is a borderline case: it leaves 
a residual invariance under time-independent gauge transforma- 
tions which is removed by explicitly imposing Gauss' law. Since 
it commutes with the Hamiltonian, this need only be done at one 
point in time, e.g. on the initial conditions. See also Appendix 

El 
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classically obscure transformation of Goldstone kinetic 
terms into gauge mass terms can now be understood as a 
consequence of Goldstone bosons being confined within 
a Compton wavelength of gauge bosons. 

Gauge invariance implies that ^ must drop out from 
any calculation of physical observables, so leaving ^ unas- 
signed and explicitly verifying that it does not enter the 
result provides a powerful error check. To make con- 
tact with the unitary gauge, note that the limit ^ ^ oo 
is equivalent to making the Goldstone bosons infinitely 
massive. Bringing an infinitely massive particle into ex- 
istence requires an infinite amount of energy, so taking 
this limit effectively removes the Goldstone bosons from 
the theory (at least as long as gravity is ignored). In this 
sense, the limit ^ — > oo corresponds to the unitary gauge, 
so the two are identified (a kind of argument known as 
"physicist's mathematics"). But note that this is really 
very different from what we did in the classical theory. 

The next time somebody tells you that "Goldstone 
bosons are just the longitudinal components of a mas- 
sive gauge boson" , consider asking if gauge bosons really 
have infinite mass. That might prove entertaining. 



C. Nielsen identities 

Early work on SSB in QFT focused heavily on the 
effective potential, V{^sc) in Eq. (|57p . and understand- 
ably so: it is relatively easy to compute in a loop ex- 
pansion (equivalent to an expansion in powers of h) and 
lends itself to an intuitive interpretation as the quantum- 
corrected classical potential. In the absence of gauge in- 
teractions, it is also easily proved to be the expectation 
value of the energy density in the lowest energy state sat- 
isfying the constraint that ^sc is spacetime independent 
[53J . Away from its minima, V{^sc) generally has an 
imaginary component, signalling that homogeneous field 
configurations are actually unstable, but this inconsis- 
tency was often ignored. 

Less easily ignored was Jackiw's observation that in 
gauge theories, V{^sc) is gauge dependent [54i]. Physical 
quantities can not depend on the choice of gauge, so ap- 
parently V{^sc) could not be an energy density in gauge 
theories. Not in every gauge anyway. But maybe in a 
special one? 

In [55|, Dolan and Jackiw performed the change to 
polar field coordinates (for the Abelian Higgs model) in- 
side the path integral (without the singularities retained 
in [4l|, so implicitly assuming the absence of vortices), 
rewrote the resulting functional Jacobian as a ghost term 
and argued that the resulting "unitary Lagrangian" is 
"the unique description of the physical dynamics of the 
system from which the gauge degrees of freedom have 
been removed by a functional integration"^. Happily, 



a one-loop calculation of V{^sc) yielded a finite result 
which could be reproduced in the R^ gauges, provided 
that the unitary limit ^ ^ oo was taken before sending 
the momentum cutoff in the loop integral — *■ cxd. 

Of course, V{^sc) is by definition a static quantity, 
not a dynamic one, and if there is any context where the 
unitary gauge might be expected to yield a finite answer, 
despite the exclusion of topologically non-trivial config- 
urations, it should be one involving spacetime indepen- 
dent configurations only. But the "unitary Lagrangian" 
of [55] is not the classical one: the new ghost field term 
arising in the functional measure is now understood to 
be equivalent to a quar tically divergent, non-polynomial 
Higgs self-coupling [5^ . 

Even so, the suggestion that this "unitary Lagrangian" 
enjoys a unique status was quickly put to rest by Nielsen 
|57t and, independently, by Fukuda and Kugo [53|- Work- 
ing in the Fermi gauges 



fiAn = -^(.d,An' 



(61) 



Nielsen showed that gauge invariance implies a simple 
differential equation which relates the dependence of the 
effective potential on ^ and $sc, respectively: 



d_ 



Ci^^'Psc) 



d 



9$., 



V{^,<^s 







(62) 



The function C(^, $sc) can be determined order by or- 
der in the loop expansion. Essentially, this means that a 
variation in ^ is always compensated by one in ^sc, keep- 
ing the value of the effective potential invariant to each 
order of the expansion. The generalization of this equa- 
tion to the non-Abelian case and to successively more 
general subclasses of the R^ gauges was carried out in 
[531 [sol [Sil [64I • The result is now collectively known as 
the Nielsen identities. 

In [58[ , a generalization of this approach was used to 
show that the full effective action (effective potential plus 
derivative terms) is gauge invariant at stationary points 
(so solutions of the effective field equations are gauge 
invariant) to each order of the loop expansion. The ef- 
fective action can be computed in any (workable) gauge, 
the energy density computed at its stationary points is 
gauge invariant, the stationary points with the smallest 
energy densities are the true vacua, and if the fields at 
such a stationary point are not spacetime dependent, it 
follows that their energy density is given by the effective 
potential. The last condition is satisfied by a wide class 
of gauges (dubbed "good gauges" by Fukuda and Kugo) , 
including Coulomb, axial, Fermi and i?j. 

On the other hand, the effective action (and potential) 
is generally not gauge invariant away from its station- 
ary points ("off shell"). As first pointed out by Vilko- 
visky, this is due to the dependence of the couplings 



This may well be how the meme that "the true dynamics is given 



by the unitary gauge" was born. 



between fields and external sources in Eq. p8)) on the 
chosen field parametrization [63] • It is possible to write 
down Nielsen-like identities which enforce invariance un- 
der field reparametrizations and to construct effective 
actions which satisfy them, and which can therefore be 
identified with energy density also off shell [64| • In par- 
ticular, the so-called Vilkovisky-DeWitt effective action 
for YM theory is just the ordinary effective action eval- 
uated in the covariant background field gauge |44|[65|: 
each field is written as the sum W° -I- Q° of a background 
part W^ (neither gauge fixed, coupled to a source or path 
integrated over) and of a quantum part Q°^ (coupled to a 
source and path integrated over as usual) whose covariant 
derivative with respect to W^ is required to vanish: 



SpQ^ + gCabcWb^Q'i = 



(63) 



Like all covariant gauge conditions, Eq. ([55]) requires 
the introduction of ghosts. If on-shell gauge invariance 
is all you need, an axial condition on Q^ will do the job 
without them. 



D. Finite temperature 

When reading old papers from the golden age of QFT, 
roughly mid-60s to mid-70s, it is hard to miss the shift- 
ing view of gauge symmetries and SSB: from neat math- 
ematical tricks allowing the construction of renormaliz- 
ablc theories to physical reality. The watershed event was 
the realization that spontaneously broken symmetries are 
restored at high temperature [63| 6a] 6^. As Weinberg 
put it, "if a gauge symmetry becomes unbroken for suffi- 
ciently high temperature, then it is difficult to doubt its 
reality" [69|]. 

To see how this comes about, consider again the ef- 
fective action. Using the conventionally defined path 
integral of Eq. (|49l) . the proper vertices in Eq. ([56]) 
are the vacuum-to-vacuum expectation values of time- 
ordered field operators: 



r(")(xi,...,x„) = (0,-K(X3lT[$(a;i)...$(x„)]lO,-oo) (64) 

In plain English, they describe sequences of scattering 
events starting in empty space in the infinitely far past 
and ending in empty space in the infinitely far future. 
If the system under study is not empty space, this is 
not an adequate model. For instance, in a heat bath in 
equilibrium at inverse temperature /3, the probability of 
a scattering event between energy eigenstates ]$„) with 
energy En follows the Boltzmann distribution 



-/3-E„ 



Er. 



-pE„ 



(65) 



then use the finite-temperature proper vertices 

I 13 \^li ■■■! ^n) = 

E, e-^""' ($,, +(X3lT [$(xi)...$(x„)] ]$/, -oo) 



E. 



-I3E„ 



(66) 



Given a complete, orthonormal set of \^n)i we should 



or, dropping the orthogonality requirement, the more 
general 

r<".,..,...,.„) = I^2:!lmil^^M ,„, 

where the trace runs over any complete set of states and 
H is the Hamiltonian. By comparison with the usual 
expression for the transition amplitude from initial state 
l$i) to final state ]$/), 

($/le-*-f^l$,) = fv^e-J'o'^^J''^'''^ (68) 

we can therefore handle the (equilibrium) finite tem- 
perature case by performing the formal substitution 
i — > /3 and restricting the functional integration to field 
configurations periodic in /3 (antiperiodic for fermions) 
[z3]|Zli[zi- The generating functional W[J] of Eq. gH) 
then becomes the partition function of statistical me- 
chanics. 

Using this technique to compute the effective poten- 
tial V{^sc) at finite temperature reveals that it picks up 
a positive mass term c>c l//3^. At a sufficiently small /3 
(the inverse critical temperature), this thermal mass term 
becomes dominating and turns ^sc = into the global 
potential minimum. The symmetry is then restored, i.e. 
<& is equally likely to be found anywhere on the zero tem- 
perature vacuum manifold. 

The argument that the "unitary Lagrangian" of [5a] 
enjoys a unique status was dealt its first blow in this 
context (and by its own originators). When the one- 
loop comparison with the R^ gauges was extended to 
finite temperature in [6S] . the critical temperatures did 
not match, and the i?j result was identified as the correct 
one. 

The discrepancy was attributed to the non- 
renormalizability of the unitary gauge, but this 
was counterintuitive: the compactification of the time 
dimension which turns QFT into equilibrium thermal 
field theory replaces loop integrals with sums over 
discrete frequency modes, each described by its own 
three-dimensional theory, and lower dimensional the- 
ories have better high energy behavior than higher 
dimensional ones. Indeed, renormalization at zero 
temperature is always sufficient to remove all infinities, 
so why did problems crop up in the finite, thermal loop 
contributions and not in the zero temperature part? 

This became known as "the unitary gauge puzzle" . 
The key to the solution is the observation that while the 
value of the effective potential is gauge invariant at sta- 
tionary points (in "good gauges" )j_its curvature (used to 
obtain the effective Higgs mass in '6^) is not: to compute 
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the latter necessarily involves going off shell V3^, where 
the identification of V{^sc) with energy density no longer 
holds. Even so, it took several iterations [7J][7^][70| (and 
a couple of decades) to recognize that the correct critical 
temperature can in fact be extracted order by order from 
any self-consistent perturbative expansion, even (!) one 
based on the "unitary Lagrangian" of |55| . 

Needless to say, the high temperature symmetric phase 
is in the null space of the unitary gauge. There is also 
a problem common to all gauges featuring unphysical 
degrees of freedom: the traces of Eq. ([S7)) run over all 
of state space, but there is no reason why ghosts etc. 
should be in thermal equilibrium with the heat bath. The 
starting point for thermal gauge field theory is therefore 
necessarily a physical gauge [73| . 



E. Effective field theories 

The temperature dependence of V{^sc) is a reminder 
that physics is not scale invariant; the world looks and 
acts differently at different energy scales. Fortunately, 
you don't need detailed knowledge of physics at the scales 
of grand unification or quantum gravity to understand 
physics at the electroweak scale, you don't need detailed 
knowledge of physics at the electroweak scale to under- 
stand nuclear physics, and you don't need detailed knowl- 
edge of nuclear physics to understand chemistry. 

In QFT, this independence of lower energy phenom- 
ena from higher energy ones is formally known as the 
Appelquist-Carazzone decoupling theorem [78[ . What it 
says is that massive fields effectively decouple at low en- 
ergy: given a renormalizable Lagrangian containing both 
massless and massive fields, you can describe its low en- 
ergy behaviour with a renormalizable Lagrangian written 
in terms of the massless fields only. The massive fields 
only contribute to the low energy Lagrangian through 
the renormalization of its couplings and fields. 

More generally, you can eliminate heavy fields from a 
Lagrangian which also contains light fields by encoding 
their effects in (generally non-renormalizable) interaction 
terms involving the light fields only. The resulting ef- 
fective field theory (EFT) is valid at energies below the 
masses of the eliminated fields [73] . The whole edifice of 
standard model extensions - technicolor, supersymmetry, 
grand unified theories (GUTs), supergravity and string 
theory with its infinite tower of massive excitations ~ im- 
plicitly depends on the suppression by powers of energy 
over mass of the effective interaction terms induced by 
the high energy extensions. 

In principle, constructing an effective field theory from 
a more fundamental one is straightforward. Take the ef- 
fective action of Eq. ((5^ , Fourier transform it and do all 
momentum integrals involving the heavy fields. Split the 
remaining integrals in two parts, one for momenta go- 
ing up to your cutoff (the mass of the lightest eliminated 
field or less), one for momenta above the cutoff, and do 
the latter too (alternatively, gauge-fix and integrate out 



the heavy fields only, leaving any local invariance specific 
to the light fields unbroken until the need actually arises 
to gauge-fix them too [8^). Transform back, and you 
are left with a non-local action on the form of Eq. (|57p . 
written in terms of the light fields only and valid down 
to distance scales ~ 1/ (momentum cutoff). 

In practice, this "top-down" program may not be pos- 
sible to carry out, even approximately, either because you 
don't know the fundamental Lagrangian or because there 
is no workable approximation scheme. In such cases, you 
may still be able to create an EFT by systematically 
writing down all interaction terms (up to some cutoff 
dimension) involving light fields only and respecting all 
known symmetries. The schoolbook example of this ap- 
proach is chiral perturbation theory of low energy QCD, a 
perturbative expansion in masses and momenta of quark 
bound states, small on the hadron mass scale ^ 1 GeV 
[8l| [84I [831 ■ In this case, the fundamental Lagrangian is 
known, but ordinary perturbation theory breaks down 
at low energy due to the growth of the effective coupling 
(confinement). 

Fortunately, this problem does not occur in Higgsed 
YM theories: small couplings stay small also at low en- 
ergy, and the Appelquist-Carazzone theorem ensures the 
existence of an EFT written in terms of the massless 
fields only (if any). Since corrections to the tree level 
terms are ex powers of couplings over energy, integrat- 
ing out the massive gauge bosons from a weakly coupled 
theory induces negligible effects at energies ^ the sym- 
metry breaking scale. In this low energy/long distance 
limit, the EFT reduces to the classical Lagrangian with 
all massive gauge bosons set to zero^. 

More generally, the classical theory should provide a 
good approximation to the finite temperature dynamics 
of modes with large occupation number, i.e. for fields 
with mass <^ temperature [84,] 3^ [M] • 



F. Non-linear sigma models as EFTs 

Consider the low energy limit of the original U(l) ~ 
0(2) Goldstone model. A look at Eqs. HID-inD is 
enough to tell that the effective Lagrangian must be 



£{9) = -ly^d^ed^e 



(69) 



i.e. that of a free, massless scalar field (the Goldstone 
boson) taking values on a circle in internal space (since 9 
is an angle, so 9 = and 9 = 27r are identified; the circle 
is of course the bottom of the Mexican hat potential). 
There can't be any p particles on shell when the energy 
density is <?C the rest mass of a p per Compton volume. 



^ I emphasize this because I have occasionally noticed some confu- 
sion on this point, especially among cosmologists, who sometimes 
seem to take classical Lagrangians a little too literally. 
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i.e. ~ X^v^, and quantum corrections at energy E can't 
be worse than ^ X^E/v. 

The factor v^ in front of the derivative term is the 
"stiffness" of 9. Make it larger and it costs more energy 
to hft 9 from the ground state (an arbitrarily chosen, 
constant 9 value). Note that massless is not synonymous 
with "cheap" ; when we speak of "low energy EFT" , we 
really mean "low energy density EFT" (just like we really 
mean "Lagrangian density" when we say "Lagrangian" 
in field theory). This is why the derivative term of a 
massless field belongs in a low energy EFT even if it is 
associated with a very large energy scale: unlike a mass 
term, a derivative term has a continuous spectrum (un- 
less Lorentz symmetry is broken, e.g. by a spatially pe- 
riodic potential a la Kronig-Penney) so arbitrarily small 
values are allowed. 

There is more interesting physics in Eq. (j69p than 
meets the eye, all due to the identification of = with 
9 = 27r. As in the Abelian Higgs model, topology guar- 
antees the existence of vortex solutions, but without a 
matching gauge field, the gradient energy of a single vor- 
tex diverges logarithmically with spatial radius. This di- 
vergence can be cured by pairing up a vortex with an anti- 
vortex winding in the opposite direction, so that their far 
fields cancel. The vortices in such a pair attract with a 
logarithmic potential; add more pairs and you can create 
a periodic vortex lattice. A two-dimensional "gas" of vor- 
tices and anti-vortices also features a high temperature 
transition above which thermal disorder wins over pair- 
wise attraction and individual vortices can roam freely 
(Kosterlitz-Thouless transition) . 

Things get even more interesting when you consider 
the generalization of the Goldstone model to higher 0(N) 
groups. The low energy limit is then given by the La- 
grangian 



C{n) = -v'^d^n-d''n 



(70) 



where n is an N-dimensional vector constrained to take 
values on the (N - l)-sphere, i.e. satisfying \n\ — 1. 
Mathematicians call this a wave map, physicists a non- 
linear sigma model (NLSM). 

For N = 3, n is a 3- vector taking values on an ordinary 
unit sphere (a 2-sphere). In terms of standard spherical 
coordinates (^, <p). 



ni = sin(z?) cos(ip) 
n2 = sin(z9) sin((/?) 
713 — cos(?9) 



(71) 
(72) 
(73) 



and the Lagrangian reads 



C{^,^)^\>^' 



[df,i}d''d + sm^{^)d^,ipd''^] (74) 



Add a quartic derivative term and you get either the 
Skyrme model [87[ , a precursor to modern chiral pertur- 
bation theory, or the Skyrme- Faddeev model !,88| , a can- 
didate low energy EFT for pure SU(2) YM theory. The 



extra terms are introduced to defeat Derrick's theorem, 
a simple scaling argument which rules out the existence 
of finite-energy static solutions to scalar theories with 
quadratic gradient terms in more than two spatial dimen- 
sions [83]. On its own, the NLSM can only have time- 
dependent three-dimensional solutions with finite energy. 

Note the non-linear term (the "NL" in NLSM), which 
suggests richer dynamics than in 0(2), but also note that 
the 0(2) case is recovered when either ip ot d is constant. 
This is easily seen to generalize: higher 0(N) models 
embed lower ones. 

The construction of 0(3) NLSM solutions is greatly 
simplified by the stereographic projection to the complex 
plane (extended with a point at infinity to represent the 
north pole) 



after which 



u{^, ip) = tan(i9/2)e"^ 



L[u) = 2u 



(1 + uUif 



(75) 



(76) 



In this form, the model is known as CP^ (Complex Pro- 
jective, one dimension). Writing u = p + iq with real p 
and q, the equations of motion are 



d^d^p + 2p 



df.d'^q + 2q 



d^'qd^.q - df^pdfj^p 
l+p'^ + q^ 

d^pdf^p — d'^qdf^q 
1 + p2 + g2 



= 



= 



(77) 
(78) 



If u is analytic in the [x, y) plane, the Cauchy-Riemann 
equations guarantee that p and q satisfy both the Laplace 
{V^p = 0) and the eikonal ((Vp)^ = 0) equation and 
therefore also the equations of motion in (a;,y). A sep- 
arable factor which satisfies the wave equation along z, 
i.e. u{t ± z), also works, so you can easily build either 
static solutions constant in z (e.g. u = (x + iy)"" , a vortex 
winding n times around the {x, y) plane) or wave pack- 
ets of arbitrary three-dimensional shape moving up and 
down the z axis. 

But the real news relative to the 0(2) case is the exis- 
tence of nontrivial maps from the 2-sphere in field space 
to the 2-sphere at spatial infinity, as in 



ly 



r + z 



r = v .x^ + y^ + z^ 



(79) 
(80) 



(see [90| for more). When n = 1, this is known as a 
hedgehog. In line with Derrick's theorem, its energy is 
proportional to volume, but like the global vortex, it has 
a gauged counterpart with finite energy: the monopole 
first described by 't Hooft and Polyakov ^^. 

The next step up the NLSM ladder is 0(4), i.e. a 4- 
vector taking values on a unit 3-sphere. Its new feature is 
the existence of knot-like maps from the internal 3-sphere 
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to the spatial 2-sphere ( "textures" to cosmologists) . Un- 
like vortices and hedgehogs, those do not have a singu- 
lar core corresponding to ^sc = in the full Goldstone 
model. Derrick's theorem guarantees that they too must 
be unstable: they shrink until their gradient energy den- 
sity becomes too large for the low energy EFT to handle. 
In the full model, the collapse ends with ^^c sliding over 
the top of the potential, allowing the knot to unwind. 
This never happens given suitably "small" initial condi- 
tions, however ^3l- Topologically trivial configurations 
(sometimes referred to as "non-topological textures" ) can 
therefore keep evolving indefinitely. 

Hadron physicists know all about the 0(4) NLSM. 
When the vector components are light mesons (bound 
pairs of up and down quarks), it corresponds to the 
leading order terms of chiral perturbation theory. The 
"sigma" in "sigma model" comes from the nuclear isospin 
singlet which acquires a VEV (the "Higgs particle"); 
the three independent vector components (the Goldstone 
bosons) are the pions. Vortices, hedgehogs and even 
polyhedral maps have all been described in this context, 
and it has been suggested that bubbles of a Disoriented 
Chiral Condensate (DCC) rotated away from its value in 
our QCD vacuum may be possible to create in collider 
experiments ;94] [95] 96] 97] [98] 99] [10^ . 

In three spatial dimensions, the map from internal 3- 
sphere to spatial 2-sphere is the last one with separate 
homotopy classes, so higher 0(N) NLSMs do not add 
qualitatively new kinds of solutions to those found at N 
= 4. This also implies that the 0(4) NLSM can provide a 
reasonable appro ximation to the dynamics of the general 
N > 4 case [m\ . 



IV. COSMOLOGY 

If you subscribe to standard hot big bang cosmology, 
the relevance of high temperature symmetry restoration 
is obvious. Immediately after the bang, the universe was 
filled with a high temperature plasma in near thermal 
equilibrium. As it cooled, V{^sc) gradually changed 
shape, from harmonic to quartic, and ^sc eventually 
rolled or tunnelled from ^sc = to a randomly picked, 
asymmetric minimum^. Given a particle horizon, i.e. a 
finite maximum radius within which light (or anything 
else) could have traveled, this phase transition could not 
be coordinated over all space, so causally disconnected 



'^ This does not imply that the finite temperature efl'ective poten- 
tial provides a good description of the dynamics of the transition 
from symmetric to broken symmetry phase. V{^sc) is computed 
for spacetime independent ^sc and (at finite temperature) using 
the equilibrium partition function. Spacetime independent, equi- 
librium configurations only occur at extrema of the potential, so 
V{^sc) can only be used to determine from which critical tem- 
perature a phase transition is allowed, not to study its dynamics. 
See [loi and [H. 



regions picked vacua independently of each other. 



A. Domains and dark energy 

This mechanism was recognized already by Weinberg 
in his seminal paper on high temperature symmetry 
restoration [69| , which introduced the now familiar anal- 
ogy with domain formation in ferromagnetism and went 
on to ask: "Does the universe consist of domains, in 
which symmetries are broken in equivalent but different 
directions? If so, what happens when a particle or an 
observer travels from one domain to another?" 

The second part of Weinberg's question was quickly 
(but, as it turns out, only partially) answered by A. Ev- 
erett, who considered the fate of light crossin g bet ween 
domains in the standard electroweak model |l03l ] and 
found it to depend on the width of the boundary: "If 
the transition between domains is smooth, with a tran- 
sition region whose thickness is large compared with a 
wavelength of the incident radiation, there is no reflection 
and one observes a transmitted wave identical to the inci- 
dent wave." A thin boundary, on the other hand, would 
result in total reflection. After the discovery in 1998 of a 
luminosit y deficit from high redshift Type la supernovae 
(SNe la) [l0l][l0l[iQi], his description of the interme- 
diate case, a domain boundary of width comparable to 
wavelength, seems almost prophetic: "A distant object 
viewed through such a domain boundary would appear 
less bright than it should. If one knew independently the 
distance and brightness of the object, say by knowing 
its cosmological red-shift and the brightness of objects 
similar to it, then the existence of such semitransparent 
domain walls would be detectable." This was published 
in 1974! 

But there is a problem with Everett's derivation which 
would not have been lost on Feynman: it's a purely 
classical treatment of the electroweak boson sector (lin- 
earized to boot). The decay to fermions of virtual weak 
bosons traversing a wide boundary is not considered. For 
a macroscopic boundary width and an incident photon 
packing enough energy to create at least a neutrino pair, 
the negligible lifetime ^ 10~^^ s of weak bosons (and the 
absence of reflection at tree level) clearly makes this the 
dominating effect. The problem then reduces to project- 
ing the photon (i.e. massless) state in the source domain 
onto the photon state in the destination domain, and the 
residual luminosity for domains related by SU(2 ) param- 
eters uj = [uji,oj2,oj3\ is easily found to be [107| 



eiuj) 



sin^(^^) "^^"^gP+"^% cos^(^H^) (81) 



where w^ 
sin^ 9w - 



UJ2 and Uy/ is the Weinberg angle, 



2 and 
0.2216. One or two electroweak domain 
boundaries between us and high redshift SNe la would 
explain their luminosity deficit without new physics, and 
may also provide the missing energy density required by 
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the apparent flatness of the universe, so obviously every- 
body but me hates the idea. 

Besides desperation for any sign of new physics, why 
did nobody follow up Everett's work in 1998? A possible 
explanation is that it had simply been forgotten^. An- 
other is the popularity of inflation: when the supernova 
news broke, the first thought to cross the mind of most 
physicists probably was "Oh, so it's still going on!" (it 
certainly was my first thought). 

Yet another reason might have been Everett's use of 
the term "domain wall" , which had since become syn- 
onymous with "topologically stable two-dimensional de- 
fect passing through ^sc = 0" . The standard electroweak 
model notoriously fails to satisfy Kibble's gr oup- theoretic 
criteria for the existence of domain walls [l09j |. and so 
must any sensible alternative to it, since a domain wall 
network would quickly and catastrophically collapse to 
a massiv e, single wall dominating its horizon volume 

In hindsight, it may seem strange that transients (i.e. 
not necessarily stable configurations) interpolating be- 
tween different values of ^sc on the vacuum manifold 
(rather than passing through ^^c — 0) were not consid- 
ered in this context. That they existed in the early uni- 
verse, and may have played an important cosmological 
role as seeds of large- scale magnetic fields llSll . w as al- 
ready well understood pl^[TT3|[TT^pl5t|ll6t|ll"7J . But 
by now you should have no problem guessing the (er- 
roneous) argument against their survival to late times: 
transforming such configurations to the unitary gauge 
turns them into collections of weak gauge bosons, which 
are unstable and decay in a microphysical time^". 

One way to see that this can not be right is to rec- 
ognize the implicit assumption that only the Higgs vac- 
uum manifold is degenerate. The transformation to the 
unitary gauge would then leave the gauge bosons with 
only one vacuum to "fall" to, so no coordination across 
space would be needed to complete the relaxation to the 
vacuum. That this assumption is incorrect in the stan- 
dard electroweak model has been elegantly demonstrated 
by Lepora and Kibble, who showed that the vacuum 
manifold of the electroweak gauge bosons is actually a 



* To ans wer th e inevitable question: no, I was not aware of it when 
I wrote llOTII , nor does Penrose seem to have been when he made 
the remarks llOSH which got me thinking about this problem. 

® A related problem is posed by topologically stable monopoles, 
which occur in any gauge theory featuring SSB of a simple group 
to a semisimple group containing a U{1) factor, like the standard 
electroweak group. This was one of the original selling points of 
inflation: if the standard model descends from a simple GUT by 
SSB, the energy density of a single GUT monopole per horizon 
volume at the symmetry breaking transition would overdose the 
universe by many orders of magnitude. Inflation solves the prob- 
lem by diluting monopoles and other defects left over from GUT 
symmetry breaking to acceptable levels. 
^^ Yes, I was once summarily dismissed with this argument by a 
supposedly top journal. 



squashed hypersphere [llg |. The unitary gauge conceals 
this degeneracy in a "spaghetti code" mess of interaction 
terms whi ch I won't even try to reproduce here (see e.g. 
Ch. 14 in pjj l. 

More generally, even disregarding that Goldstone 
bosons are not "just longitudinal components of massive 
gauge bosons" in a consistently quantized theory, it is 
obvious from the vantage point of a physical gauge that 
the argument must be wrong, since it assigns a privileged 
role to an arbitrarily chosen point on the vacuum mani- 
fold. It is isomorphic to somebody in New York claiming 
that people in London are not standing vertically and so 
must all be falling over: after all, they are not aligned 
with the Empire State Building'^-'^! 

Since Earth's (idealized) surface and the vacuum man- 
ifold arc both Riemannian and symmetric, it is of course 
always possible to connect any two points on either one 
by a curve featuring a continuous sequence of tangent 
spaces, each having equivalent but different definitions 
of "vertical" (or of "gauge boson X"). Your ability to 
stand vertically at your current location is sufficient to 
know that you could create a chain of people connecting 
any two cities on Earth, all standing vertically accord- 
ing to their own, local definition. Likewise, the existence 
of a stable gauge boson in a single vacuum is sufficient 
to know that you could create interpolating configura- 
tions between any two points on the vacuum manifold 
using only stable gauge bosons. Massless gauge bosons 
are necessarily stable (below some fermion pair produc- 
tion threshold pJ0iil21,]), so if your theory has them, you 
are all set. Nambu gave us a convenient sufficient crite- 
rion for their existence: the broken symmetry group must 
be semisimple (as in the standard electroweak model) or 
have rank > 2 f21'|. In other words, the microphysical 
decay argument fails in any realistic theory. 

Microphysical stability does not imply classical stabil- 
ity of macroscopic configurations, of course. What it 
does imply is that the realignment of ^sc after the initial, 
random choice of vacua in causally disconnected regions 
must be resolved according to the effective field equa- 
tions. In a cosmological model with finite particle hori- 
zons, this is enough to guarantee the existence of elec- 
troweak (and maybe other) domains. Their boundaries, 
whether classically stable or not, might be possible to in- 
fiate beyond the current horizon with a late period of ex- 
ponential expansion, but that would require new physics 
at the electroweak scale (or lower) and risk diluting away 
any net baryon number created at the electroweak phase 
transition. 



^^ An esteemed editor at another top journal recently informed me 
that the unitary gauge can always be applied using partially 
overlapping patches. This is isomorphic to claiming that the 
existence of a complete world atlas made of partially overlapping, 
flat charts proves that Earth is flat. Yes, flat-earthers are funny. 
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B. Global and local textures 



In the early 90s, textures, topological [122l | and not 

[123| |. were considered serious candidate sources of the 
primordial density perturbations which seeded the large 
scale structure of the universe. More recently, it has been 
suggested that the large anomalous cold spot found by 
WMAP in the cosmic microwave background (CMB) is 
due to a texture which originated at a symmetry breaking 
scale 8.7 x 10^^ GeV (intriguingly clos e to t he hypotheti- 
cal GUT symmetry scale ~ 10^'^ GeV [l23 |) and eventu- 
ally colla psed at redshift z ^ 6, a billion years after the 
big bang [l25|. 

The textures in question have all been global, of the 
0(N > 3) Goldstone model -> NLSM variety. This runs 
counter to the commonly held view that continuous sym- 
metries should be local (i.e. gauged), and it runs into se- 
rious difficulties with quantum gravity (string theory in 
particular does not seem to allo w any global continuous 
symmetries; see e.g. p. 255 in [126|). The reason for 
nevertheless sticking with global textures is the following 
argument, due to Turok 123 |: 

A scalar field <& constrained to the vacuum manifold 
can be written 



^{x) = U{x)'^o 



(82) 



where U{x) is a symmetry transformation and $o is an 
arbitrarily chosen reference point on the manifold. If $ is 
gauged, with covariant derivative D^ given by Eq. 
the gauge field can "fall" to 



W^ = - [d^U) u 
9 



(83) 



at every point in space. This makes 1)^$ = 0, so the gra- 
dient energy of <1> is zero. Since W^ is a pure gauge con- 
figuration, its energy also vanishes and the configuration 
stops evolving. Naively, this relaxation process should 
complete on the time scale of the gauge interaction, i.e. 
in a microphysical time, so any gauged texture will be 
gone long before it can have observable consequences. 

It is not hard to see where this argument came from. If 
we start from the trivial vacuum <^{x) — $o, W^{x) — 
and apply the gauge transformation U{x) according to 
Eqs. dMl) and dSSl), we obtain Eqs. dM]) and dM]). A 
gauge transformation can not affect physical observables, 
so the energy density must remain zero. 

In other words, U{x) is just the inverse of the transfor- 
mation used to impose the unitary gauge, so this is just 
the usual microphysical decay argument in light disguise. 
How does it go wrong? Let me count the ways. 

One: As often pointed out by Vachaspati, there is no 
reason why $ should have vanishing gradient energy at 
finite temperature. By dimensionality and equipartition 
of energy alone, given an inverse temperature (3 we should 
expect D^$ - I//32. 

Two: At first sight it would seem that this argument 
must work for any field configuration. But you already 



know that domain walls, Nielsen-Olesen vortices and 't 
Hooft-Polyakov monopoles are stable. Clearly, the argu- 
ment fails for them. Why? An easily seen reason is that 
they have cores where <^{x) — 0, which can not be writ- 
ten according to Eq. ([82]) (i.e. they are in the null space 
of the unitary gauge). Try setting t/ = and the C/~^ 
in Eq. ([55]) blows up. Stated another way, U fails to be 
unitary at the core of such a defect: C/^ 7^ U~^ . This 
clearly does not apply to textures, which have no singu- 
lar core, but topological textures nevertheless owe their 
existence to the existence of maps in different homotopy 
classes. Transformations from one homotopy class to an- 
other are by definition discontinuous, so even if the U~^ 
in Eq. (|55|) won't blow up, the dfJJ will. Gauging the 
theory lets you move the discontinuity from <f> to W)^, 
but does not eliminate it. 

Three: Even topologically trivial $ configurations 
carry conserved quantities (energy, momentum, gauge 
currents, all in derivative terms) which must go else- 
where, i.e. to fermions, upon relaxation. Fermions can 
only be produced effectively while the energy and charges 
within the Compton volume of a fcrmion pair (e.g. elec- 
tron + anti-neutrino for electroweak interactions) are > 
the total mass and charges of such a pair (on shell). 
Once $ gradients fall below this threshold, dissipation 
to fermions becomes exponentially suppressed. The evo- 
lution of $ and W^ then becomes a Hamiltonian flow, 
but need not stop. 

Combine the first and third point and you can estimate 
the initial size beyond which charged $ configurations 
should have been safe from dissipation: the horizon scale 
when (3 crossed above the Compton length of the lightest 
charged fermion. For electroweak interactions, that's ^ 
10'^ km |107| . If such configurations then simply tracked 
overall metric expansion, their corresponding minimum 
size today would be r^ one lightyear. Apply the same 
redshift to their energy density and you inevitably land 
right at the current "dark energy" scale, 10"'^ eV . 

Four: As Nambu pointed out long ago, D^^ — 
does not guarantee vanishing energy when there are lin- 
ear combinations of commuting generators which annihi- 
late <&, i.e. when massless gauge fields remain after SSB 

For a heuristic understanding of this point, consider 
the standard electroweak model, which conveniently has 
the same scalar sector as the 0(4) Goldstone model: set- 
ting the photon Ap^(x) = along with the massive gauge 
bosons and applying some U{x) will certainly yield a 
gauge-equivalent vacuum according to Eqs. (|82p and 
([55)1 . But thanks to the absence of a mass term, any 
finite, constant A^ is an equally valid vacuum. Starting 
from such a configuration, i.e. from W^ ex 1 -|- ra (see 
[lOTJ 'l Eq. ([55)1 picks up an additional term not contained 
in Eq. (j83p . The common term does not depend on the 
gauge fields, so evidently the same U{x) yields different 
vacua for different choices of initial constant A^j_ . This is 
a simple exam ple of the degeneracy of the gauge sector's 
vacuum [118J . The implication is that just as for the 
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scalars, an independent choice of vacuum at each point 
in space will not generally result in a global energy mini- 
mum. That requires coordination, so the usual causality 
bound (the horizon) applies. 

In a slightly more mathematical language, for any 
"good gauge" (in the sense of Fukuda and Kugo [Sa]) the 
lowest energy configurations are spacetime independent, 
so the energy density in a sufhciently small neighbor- 
hood of such a minimum (in field space) can be approxi- 
mated by an ordinary Taylor expansion. Since there are 
no linear terms, the first non-trivial term is the Hessian 
matrix of second order derivatives in the fields, i.e. the 
mass matrix. For the minimum to be non-degenerate, 
the discriminant (the determinant of the Hessian) must 
be positive definite. If there is at least one massless field, 
this condition is not satisfied: the minimum is degenerate 
and SSB ensues. 

This can be seen explicitly by working out the low 
energy EF T of the standard electroweak model in an ax- 
ial gauge jl27l |. You can start from scratch or you can 
get a head start by using the gauged NLSM (GNLSM), 
valid for energies <C the Higgs mass (now known to 
be > 100 GeT^), which was written down long ago 
to paranietriz e the unknown syrn metry breaking sector 
Hi [Hildas (see Ch. 2 in [ml for a review). Choos- 



ing the second route, the Goldstone field matrix in polar 
field coordinates 9a is 



the interaction energies under consideration). We there- 
fore need to isolate the linear combination of gauge fields 
with zero mass, i.e. the photon, as defined in an arbitrary 
vacuum — [6'i, 6*2, 6*3]. 

In the basis [S^, IV^, IV^, IV^] , the Cgnlsm terms 
quadratic in B^ and W^ give rise to the mass matrix 





r 5l 


gsg-wQi 


gBgw'S>2 


-gngw^a 


y' 


gsgw^i 


9w 








2 


gBgw'^2 





9w 







. -gsgw^a 








g'w 



where we have introduced the convenient auxiliary quan- 
tities 



61 


= [6'i6'3(cos(6i) -1)+ 602 sm{9)] /O^ 


(89) 


62 


= [6'26'3(cos(6i)-l)-6i6'isin(6i)]/6|2 


(90) 


63 


= '{el + 9i)cos{e) + el]/9^ 


(91) 



satisfying (9i) + (82) + (O3) = 1. The eigenvalues 
of Eq. ((88|l are the tree level masses squared of pho- 
ton, W^ and Z°. The two degenerate eigenstates can be 
orthogonalized to obtain^^ 



.r„/2 _ 



cos(6l/2) 4- i -^ sm{9/2) (84) 



A^ oc 


[gw/gs, -ei,-Q2, 03] 


(92) 


w;, cc 


[0, -62, 61, 0] 


(93) 


Wl cc 


[0,6163, 6263, e?-i-e2] 


(94) 


Zfj. oc 


[-gs/gw, -01,-02,03] 


(95) 



with 9 = 



hf + {92? + (^3)', and 



0"t cj,+ 



-^■^ 



The covariant derivative acting on S is 



D^ = d^ 



■gw. 



l^-^W^T°- -i-^B„T^ 



.gs 
2 ^^' 



(85) 



(86) 



and the full Lagrangian is 



-GNLSM 



-\b^,b^'' - \w;,w''^^'' 



V 



-Tr 



{D^X? (DX) 



(87) 



You can easily convince yourself that it is equivalent to 
the standard electroweak boson Lagrangian with the ra- 
dial Higgs degree of freedom clamped to its VEV (just 
write out all terms explicitly and compare). It has been 
used as is to compute the one-loop thermal effective ac- 
tion for an electroweak plasma at temperatures below 
the mass of a (hea vy) Higgs and above that of the weak 
gauge bosons J132J |. but we are interested in the real low 
energy limit, where the massive gauge bosons can not be 
excited either and must be set to zero along with radial 
Higgs excitations (so there is no assumption about the 
Higgs mass here, other than that it is much larger than 



To eliminate the massive bosons, invert Eqs. (|92 |) - ((95|) 
and set W^ — W^ — Z^j, — Q (note that given orthogo- 
nal eigenstates, inversion amounts to normalizing them, 
assembling them in a column matrix and transposing). 
The result is 



Bf_, = Afj^cos{ 



'w 



Wl = -A^0ism{9w) 
Wl = -Af,02s\Yi{9w) 

w) 



Wl = A^63sin 



(96) 
(97) 
(98) 
(99) 



To obtain the low energy EFT of the electroweak boson 
sector to leading order, substitute Eqs. ([M)) - ([M)l into 
Cgnlsm and find 



C 



EFT — 






^^,9^^9 



4sin2(6l/2) 
02 



X d,,. 



-\{d^,A, - d^A^f 



^w) 



[A^d^Oa - A^d^OaY 



(100) 



^^ Note that the scalar product of Eq. II92I I (properly normalized) 
for e = and § = Co is Eq. lISTI l. 
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The first row in Eq. (fTOO| is just the plain 0(4) NLSM in 
polar field coordinates, the second row is the Maxwell La- 
grangian, the third row couples them ex sin {9w), acting 
as an effective photon mass term when 9 is not constant. 
Everett's classical result is now easily recovered by noting 
that a photon can penetrate a region of varying 6 only if 
its total energy exceeds its effective mass, which is c>c the 
9 gradient. 

For any constant 9, Left reduces to plain electro- 
magnetism, as it must^'^. Note that v ~ 246.3 GeV^ 
makes the electroweak vacuum very stiff; you would need 
~ 10'*^ joule to "melt" a cubic centimeter of it and cre- 
ate the electroweak equivalent of a DCC. In cold war 
terms, that's ~ 10^^ megatons of TNT, enough to com- 
pletely vaporize Earth, journals included, a couple billion 
times. Being immersed in our vacuum, such a configu- 
ration would also immediately snap back to it (unless 
dynamically stable, which might be theoretically possi- 
ble fl4|). It is only on astronomical scales that random 
9 gradients can be shallow enough to be long-lived. 

Specializing to the time-axial gauge^'* Wq = 0, the 
electric and magnetic fields are 

E = -daA (101) 

B = V -x A (102) 

Varying Left in ^o yields the Gauss constraint 

V ■ E = do(FHA-V9 (103) 

(a plane in A space) where the matrix H has components 



where G, with components 



Hab = sin 



yw) 



d9a d9b 



(104) 



This completely fixes the gauge, leaving A with only two 
independent degrees of freedom. The energy density be- 
comes a manifestly non- negative sum of quadratic forms, 



i{E^- + S') 



p = -\E' + B' 



1 



oT L,2 



do9' {i^'G + A^H]do 



~^dm9 Gdm9 
-\ (e,kiAkdi9^ H[e,^^A^d,,9] (105) 



^^ A truly stunning objection raised against Ceft is that it is less 
symmetric than the full electroweak Lagrangian, so "important 
terms are missing". Yes, Virginia, the low energy EFT has less 
symmetry than the full theory! It's called SSB, you may have 
heard of it. 

^* Even if you are uncomfortable with the time-axial gauge at the 
quantum level, this is not a problem after the EFT has been 
obtained using e.g. the covariant background field gauge. In 
the present case this is a non-issue, since we are staying at tree 
level and disregarding quantum corrections as negligible at low 
energy. 



trafc 



5ad5b 



1 — cos 9 \ Vdtj 

Tit ^ cda^ ceb 



(106) 



is the 3-sphere metric, with eigenvalues 1/4 and (doubly 
degenerate) < (1 - cos(6'))/(26'2) < 1/4. H also has 

no negative eigenvalues (but zeros along all axes of 9 
space) ^^, so by the spectral theorem, A ^ can only 
increase p for a given 9. 

Since the low energy EFT of electroweak interactions 
(valid whether the minimal model is the correct UV com- 
pletion or not) contains the 0(4) NLSM, any solution 
obtained in the latter, textures included, is also a valid 
electroweak solution on macroscopic scales. It should be 
obvious that this result generalizes to larger symmetry 
groups. In particular, any viable GUT must embed the 
standard model and so must contain at least nine mass- 
less gauge fields (eight gluons and a photon, but there 
may be more; "hidden" sectors which only couple to the 
standard model fields via gravity are commonplace e.g. 
in string phenomenology). 

Usual unitary gauge shenanigans aside, the one intelli- 
gent concern which h as be en raised in this context, echo- 
ing the discussion in [127], is that the coupling between 
photon and 9 (the third row in Eq. pOOp ) may signif- 
icantly affect the dynamics of electroweak textures, in- 
validating numerical simulation results obtained in the 
plain 0(4) NLSM. While only simulations based on the 
full EFT will tell for sure, it should be noted that GUTs 
tend to have complicated Higgs sectors (even the original, 
minimal SU(5) model needs two Higgs multiplets, one 
in the adjoint to obtain the standard model group, one 
in the fundamental representation to break electroweak 
symmetry) and that the NLSM of a GUT could have vi- 
able subgroups which do not couple directly to the pho- 
ton. To the extent that the 0(4) NLSM is a passable 
representative of the general N > 4 case, it should also 
be a reasonable approximation of the dynamics of tex- 
tures arising in such subgroups. 

Finally, I can't resist pointing out that the kind of en- 
ergy released by the collapse of an electroweak texture 
could conceivably match that of even the largest gamma 
ray bursts (GRBs), ~ lO**^ joule (equivalent to one cu- 
bic meter of electroweak scale plasma). That may be 
something worth thinking about when confronted with 
"shots in the dark" not easily accomodated by the stan- 
dard massive stellar collapse model, like GRB 070125, 
and one more reason to undertake dynamic simulations 
of Ceft- 



^^ Note the emergence of two natural metrics in field space, one 
(Gab) associated with the scal ar sector and one {Hat) with the 



gauge sector, in line with [11 
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V. CONCLUSION 

The unitary "gauge" is strictly speaking not a gauge, 
but an unphysical constraint which excludes an impor- 
tant part of configuration space and conceals the full sym- 
metry of the theory. It is oblivious to non-perturbative 
solutions like sphalerons and topological defects, even at 
the classical level, and is undefined in the high temper- 
ature symmetric phase. Upon quantization, it yields a 
non-renormalizable and therefore mostly useless pertur- 
bative expansion. Identifying it with the singular limit 
^ -^- oo of the R^ gauges, equivalent to making Gold- 
stone bosons infinitely massive, saves renormalizability. 



but only at the cost of introducing unphysical degrees 
of freedom which can not be assumed to be in thermal 
equilibrium with physical ones. 

All these difficulties can be avoided by adopting a phys- 
ical gauge, as is routinely done in non-perturbative nu- 
merical simulations. Carrying out the effective field the- 
ory program in such a gauge makes the transition from 
symmetric high temperature phase to broken low temper- 
ature phase explicit and suggests that several high profile 
problems in cosmology may have a common denominator: 
the overly simplified picture of gauge symmetry breaking 
painted by the unitary gauge. 
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APPENDIX A: OFF THE BEATEN PATH 

This review presents the mainstream, semiclassical pic- 
ture of the Higgs mechanism: at low energy, the massless 
modes of the classical Lagrangian are believed to provide 
a good first approximation to the non-perturbative back- 
ground, on top of which minor quantum corrections can 
be computed. There are other views. 

As I write this, no Higgs particle has ever been de- 
tected. Even if something like the standard electroweak 
Higgs boson is eventually found, you could still take the 
stance, sometimes advocated by Veltman, that the per- 
turbative expansion defines the theory, while the elec- 
troweak Lagrangian and path integral are nothing more 
than convenient "bookkeeping devices" used to aid in 
the construction and evaluation of Feynman diagrams 
[l33j |. If this is the case, there is no such thing as a non- 
perturbative electroweak sector. 

The strongest reason not to believe this is the success 
of lattice QCD. The lattice is essentially a discretization 
of the full path integral; QCD is a YM theory, like the 
standard electroweak model; and on the lattice, QCD re- 
produces the light hadron mass spectrum to better than 
1% [1341] . This is a result from deep non-perturbative ter- 
ritory which no summation of Feynman diagrams could 
ever reproduce. If the path integral works so well beyond 
perturbation theory for QCD, why should it not work be- 
yond perturbation theory for electroweak interactions? 

But what the lattice giveth, the lattice can taketh 
away. 

There is an argument, based on lattice regularization, 
by which no quantity with vanishing mean value on its 
gauge orbit can have a non-zero VEV. While t he or iginal 
demonstration, known as "Elitzur's theorem" [l35J |. falls 
well short of what mathematicians would call a theorem 
(it actually shows that a compact Abelian gauge field 
minimally coupled to an 0(2) sigma model can not have 
a finite VEV when the lattice is taken to the continuum 
limit), the argument is commonly believed to generalize 



The main thrust is (almost) obvious by inspection: ex- 
plicitly, the path integral expression for the expectation 
value (Q) of some ^-dependent quantity Q{^) is 



The exponentiated action is not affected by symmetry 
transformations, so for each orbit around field space (e.g. 
the bottom of the Mexican hat potential) it can be fac- 
tored out of the integral, reducing the integrand to 



Q($)e' 



(J*) 



(A2) 



For vanishing external source (J = 0), if Q{^) averages 
to over an orbit, the integral over that orbit will vanish; 
if this holds for all orbits, (Q) must also vanish. The hard 
part is proving that (Q) — > for finite J — > 0, which is 
not immediately obvious since the path integral runs over 
an infinite-dimensional space'^^. 

Putting the path integral on the lattice turns it into a 
finite number of nested, ordinary integrals, one for each 
lattice site and field. Taking the continuum limit then re- 
veals that the argument fails for global symmetries (i.e. 
(Q) can remain finite for arbitrarily small, finite J), but 
apparently does hold for local symmetries'^. In particu- 
lar, this implies that the vacuum expectation value of a 
Higgs field must vanish: ^sc = {^) = 0. 

What should we make of this? Elitzur's own assess- 
ment was clear: for SSB to occur, the gauge symmetry 
must be explicitly broken by imposing a gauge condi- 
tion which leaves some global symmetry unbroken (ei- 
ther the gauge symmetry restricted to spacetime inde- 
pendent transformations or some other symmetry). It is 
the global symmetry that is spontaneously broken. This 
is in line with the standard (well informed) interpreta- 
tion of the Higgs mechanism, and with the classical view 
that the theory is undefined until the gauge is completely 
fixed. 

Remember the "good gauges" of Fukuda and Kugo 
j5a |: only after a gauge condition is imposed can you 
know whether a given field configuration is a vacuum. 
The usual textbook presentation does things the other 
way around; first a "vacuum" is picked, then the uni- 
tary gauge is imposed. The validity of this procedure 
can only be verified a posteriori, and as we have seen, it 
does not pass the check at the non-perturbative level. In- 
deed, the unitary gauge and the R^ gauges (for generic £_) 
break the gauge symmetry both locally and globally, un- 
like most common gauges. Neither is therefore adequate 
for studying SSB (as opposed to studying perturbations 



{Q)=JV fv<^Q{^)i 



,i{£+./*) 



(Al) 



^^ From a cosmological perspective, nitpicking about finite "exter- 
nal" sources may strike you as nonsensical until you realize that 
as far as you are concerned, everything now entering your particle 
horizon is an external source, quantum and thermal fiuctuations 
about a vanishing expectation value included. 

^'^ A major loophole is that the continuum limit is unlikely to exist 
for theories which are not asymptotically free. Theories with a 
U(l) product group, like the standard electroweak model, are 
not; that's one reason to view them as low energy EFTs derived 
from something more fundamental. If the underlying theory is a 
simple GUT, the continuum limit may exist. If the underlying 
theory is something else, maybe involving a fundamental length 
scale, all bets are off. 
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on a given, constant Higgs background, a.k.a. particle 
physics). 

So far so good. But what if you impose a gauge con- 
dition and ($) still vanishes? 

In |137l | it was argued that the residual invariance 
of the time-axial gauge under time-independent gauge 
transformations implies (for the Abelian Higgs model, 
in the canonical operator formalism) ($) = 0. This re- 
sult was obtained for "physical" states of infinite norm, 
which may give you some pause (are field expectation val- 
ues well-defined?), but the s ame c onclusion was reached 
again by a different route in [l38|, for arbitrary symme- 
try groups, using the lattice-regularized path integral (in 
Euclidean space, subject to the usual provisos about the 
validity of analytic continuation to Minkowski spacetime 
and the existence of a continuum limit) ^^. 

If $ were a physical observable, this would mean 
that repeated independent measurements must yield ran- 
dom values with average and variance ($^$), and we 
would have to conclude that $ is in a superposition state 
smeared out symmetrically over each gauge orbit (an idea 
first floated in }139j ). But since $ is an unobservable, 
gauge dependent quantity, all we can say (if we believe 
this result) is that ($) (and (<I>^<I>)) is not a good order 
parameter in the time-axial gauge, i.e. that it is not use- 
ful for the determination of the pres ence and nature of 
SSB in that particular gauge (in 139], it was argued that 
(<!') is a good order parameter in the Landau gauge only). 

While the vanishing of ($) in the time-axial gauge may 
thus be nothing more than a quirk of that (incomplete) 
gauge condition, in [138.] it was conjectured to be a phys- 
ical disorder effect caused by a "gas" of instantons (and 
maybe other topologically non-trivial configurations) of 
microphysical size, with average separation on the order 
of the inverse Higgs mass. Elitzur notwithstanding, the 
gauge symmetry would then also remain unbroken under 
global transformations. 

In the electroweak context, there is an obvious prob- 
lem with this picture: contrary to observation, nei- 
ther photons (as ordinarily defined) nor left-handed 
fermions would propagate freely on everyday distance 
scales, which are ^ the inverse Higgs mass (right-handed 
fermions would, since they are weak isospin singlets). 
The only way out of this dilemma is another conjec- 
ture, first publicized in ,140.] but attributed to Susskind: 
all fields which are not singlets under weak isospin, i.e. 
the SU(2) subgroup of the standard electroweak group 
U{1) X SU{2), are confined to singlet bound states, like 
quarks and gluons under color SU(3) of QCD. What we 
see as freely propagating particles are either such singlet 
bound states or fundamental singlets. In particular, the 
left-handed fermions observable at low energy all consist 



of a fundamental left-handed fermion bound to a funda- 
mental Higgs boson. The observable "Higgs particle" is 
actually a bound pair of fundamental Higgs -I- anti- Higgs. 

The confinement conjecture was, once again, moti- 
vated by lattice results. When simple models with Higgs 
fields in the fundamental representation were put on the 
lattice and their parameters were varied (in the "frozen 
Higgs", i.e. GNLSM approximation), no indication was 
found of a phase boundary (discontinuities in physi- 
cal quantities) separating the confinement regime (small 
Higgs VEV, large gauge coupling) and t he H iggs regime 
(large Higgs VEV, small gauge coupling) [l4l| . No quali- 
tative difference therefore seems to exist between the two 
regimes, in the same sense that no qualitative difference 
exists between water vapor and liquid water^^. But there 
are s ignificant quantitative differences, first detailed in 
[l45J . 

For the confinement conjecture to work, the non- 
Abelian interaction must be strongly coupling. Electro- 
magnetism is not strongly coupling, so there can be no 
significant weak gauge boson contribution to the photon, 
i.e. no significant mixing between U(l) and SU(2) gauge 
bosons. The electric charges of observable particles are 
then just their U(l) hypercharges, and the photon does 
not interact significantly with the SU(2) instanton gas. 
Since the three massive gauge bosons are an almost pure 
SU(2) triplet, they must have almost identical masses, 
significantly larger than the standard ones due to the 
strong SU(2) coupling (~ 125 GeV). At low energy, the 
particle spectrum is the same as in the standard, weakly 
coupled electroweak model, but the composite nature of 
quarks and leptons starts showing up around 80 GeV, the 
mass of the standard W^ bosons. At higher energies, 
there is a complicated spectrum of hadron-like bound 
states. 

This scenario was all but killed in 1983 by the ex- 
perimental detection of the W^ bosons at 80 GeV and 
of the Z" boson at 91 GeV, as predicted by the stan- 
dard, weakly coupled electroweak model. Subsequently, 
precision electroweak measurements have shown no sign 
of composite structure up to energies well in excess of 
100 GeV, also ruling out attempts to save electroweak 
confinement through additional assumptions about its 
non-perturbative dynamics [149] . 

Backtracking, the demise of the electroweak confine- 
ment conjecture reduces the instanton gas suggestion of 
[l38j ] to an interesting thought experiment. It may still 
be relevant to other theories, but not to the only exam- 
ple of the Higgs mechanism actually known (?) to be 



^^ Keep in mind that (<&) = does not imply <& = 0. Quantum 
fluctuations alone guarantee that (<l?t<J)) can not vanish, even in 
the symmetric phase. 



^^ Subsequent work showed that this property, dubbed "comple- 
mentarity", can be lost w hen the radial Higgs degree of free- 
dom is al lowed to fluctuate [142l | and when fermions are included 
[l43l | [l44J . These days, invoking complementarity therefore re- 
quires additional assumptions about unknown non-perturbative 
dynamics, in particular the absence of spontaneous chiral sym- 
metry breaking. 
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realized in nature. 

What's left? Out on the fringes of the arXiv, you 
can still occasionally see (presumably well-meaning) ar- 
guments that SSB is impossible in QFT on fundamen- 
tal quantum mechanical grounds, because a symmetric 
Hamiltonian can not produce an asymmetric state start- 
ing from a symmetric one. By the same logic, it is clearly 
impossible to find out whether Schrodinger's cat is dead 
or alive, or to measure the Jx angular momentum com- 
ponent of an electron known to have J^ = 1/2. You 
may also be told that a wave function prepared on one 
side of a double well potential will evolve to a symmetric 
shape, and that by analogy, classically stable topological 
defects can unwind without passing through energetically 
forbidden configurations, courtesy of quantum tunneling. 
If that happens, inquire about the rate of such events in 
realistic field theories, then compare the answer with the 
age of the universe. 

Yet another argument which you may come across is 
that the universe can not be likened to open thermody- 
namic systems, where SSB demonstrably occurs, because 
unlike such systems it is isolated and thus not subject to 
random external disturbances. I don't know about the 
universe of those who write such things, but the universe 
which I can see around me is a ball which has been grow- 
ing at the speed of light for the past 14 billion years (give 
or take a few), and for all that time, its boundary in ev- 
ery direction has been sampling a heat bath hot enough 
to melt the electroweak vacuum, and who knows what 
else. 

That works for me. 



APPENDIX B: BOOK TIPS 

Georgi once started off a nice review of effective field 
theories [T^l with the following piece of advice about old 
literature on almost any subject: "Ignore it! With rare 
exceptions, old papers are difficult to read because the 
issues have changed over the years." I respectfully dis- 
agree. There is often much more to the original papers 
than what you'll find in streamlined textbook presen- 
tations. Read them, and you may discover hidden or 
forgotten gems. 

But textbooks obviously have their place, and there 



are some which you may find particularly useful if field 
theory is not your day job. Since I am an old guy, I know 
mostly old books. Don't worry though. You could have 
slept through the past quarter century without missing 
much if any new particle theory worth knowing about. 
I f you are going to get only one book, get Cheng & 



Li [147[. It starts with basic QFT, thoroughly works 



through the whole standard model and does not stop be- 
fore it has also told you about technicolor and SU(5). The 
introductions to SSB and to path integral quantization 
of gauge theories are excellent. Journal editors should 
read them religiously at least once every full moon. 

The most accessible QFT textbook I know of remains 
Ryder [148|. Among other things, it features the best 
introduction to non-perturbative aspects. 

Another excellent QFT textbook which also covers the 
stan dard electroweak model and SU(5) is Bailin & Love 
[l49J |. Unusually for a textbook, it introduces finite tem- 
perature field theory and high temperature symmetry 
restoration. It is also a good place to start learning about 
anomalies, which you need to understand if you want to 
understand the role of sphalerons in baryogenesis. 

More advanced discussions of path integral quantiza- 
tion, SSB and fi eld t heory at finite temperature can be 
found in Rivers jl50l |. 

For an introduction to supersymmctry and string the- 
ory, or just for something more recent, consider Dine 
[l5l| . but understand that it's a very compressed pre- 
sentation of essentially all fundamental physics, the kind 
of book best enjoyed when you already know the subject. 
If you don't, see it as a starting point for your literature 
searches. 

Hat off to Dine for being secure enough in his physicist- 
hood to openly admit that he finds symmetry breaking 
"a puzzling notion in gauge theories" (p. 17). 
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